Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 41188 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 12 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 6.6 MiB |
| Average record size in memory | 168.0 B |
Variable types
| NUM | 10 |
|---|---|
| CAT | 10 |
| BOOL | 1 |
| Dataset has 12 (< 0.1%) duplicate rows | Duplicates |
euribor3m is highly correlated with emp.var.rate and 1 other fields | High correlation |
emp.var.rate is highly correlated with euribor3m and 1 other fields | High correlation |
nr.employed is highly correlated with emp.var.rate and 1 other fields | High correlation |
previous has 35563 (86.3%) zeros | Zeros |
Reproduction
| Analysis started | 2021-09-13 20:09:03.295935 |
|---|---|
| Analysis finished | 2021-09-13 20:09:31.351576 |
| Duration | 28.06 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
age
Real number (ℝ≥0)
| Distinct | 78 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.02406041 |
|---|---|
| Minimum | 17 |
| Maximum | 98 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 17 |
|---|---|
| 5-th percentile | 26 |
| Q1 | 32 |
| median | 38 |
| Q3 | 47 |
| 95-th percentile | 58 |
| Maximum | 98 |
| Range | 81 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 10.42124998 |
|---|---|
| Coefficient of variation (CV) | 0.2603746315 |
| Kurtosis | 0.7913115312 |
| Mean | 40.02406041 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.7846968158 |
| Sum | 1648511 |
| Variance | 108.6024512 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 31 | 1947 | 4.7% | |
| 32 | 1846 | 4.5% | |
| 33 | 1833 | 4.5% | |
| 36 | 1780 | 4.3% | |
| 35 | 1759 | 4.3% | |
| 34 | 1745 | 4.2% | |
| 30 | 1714 | 4.2% | |
| 37 | 1475 | 3.6% | |
| 29 | 1453 | 3.5% | |
| 39 | 1432 | 3.5% | |
| Other values (68) | 24204 | 58.8% |
| Value | Count | Frequency (%) | |
| 17 | 5 | < 0.1% | |
| 18 | 28 | 0.1% | |
| 19 | 42 | 0.1% | |
| 20 | 65 | 0.2% | |
| 21 | 102 | 0.2% |
| Value | Count | Frequency (%) | |
| 98 | 2 | < 0.1% | |
| 95 | 1 | < 0.1% | |
| 94 | 1 | < 0.1% | |
| 92 | 4 | < 0.1% | |
| 91 | 2 | < 0.1% |
job
Categorical
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| admin. | |
|---|---|
| blue-collar | |
| technician | |
| services | |
| management | |
| Other values (7) |
| Value | Count | Frequency (%) | |
| admin. | 10422 | 25.3% | |
| blue-collar | 9254 | 22.5% | |
| technician | 6743 | 16.4% | |
| services | 3969 | 9.6% | |
| management | 2924 | 7.1% | |
| retired | 1720 | 4.2% | |
| entrepreneur | 1456 | 3.5% | |
| self-employed | 1421 | 3.5% | |
| housemaid | 1060 | 2.6% | |
| unemployed | 1014 | 2.5% | |
| Other values (2) | 1205 | 2.9% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 8.955229679 |
| Min length | 6 |
marital
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| married | |
|---|---|
| single | |
| divorced | |
| unknown | 80 |
| Value | Count | Frequency (%) | |
| married | 24928 | 60.5% | |
| single | 11568 | 28.1% | |
| divorced | 4612 | 11.2% | |
| unknown | 80 | 0.2% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.831115859 |
| Min length | 6 |
education
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| university.degree | |
|---|---|
| high.school | |
| basic.9y | |
| professional.course | |
| basic.4y | |
| Other values (3) |
| Value | Count | Frequency (%) | |
| university.degree | 12168 | 29.5% | |
| high.school | 9515 | 23.1% | |
| basic.9y | 6045 | 14.7% | |
| professional.course | 5243 | 12.7% | |
| basic.4y | 4176 | 10.1% | |
| basic.6y | 2292 | 5.6% | |
| unknown | 1731 | 4.2% | |
| illiterate | 18 | < 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 19 |
|---|---|
| Median length | 11 |
| Mean length | 12.7109595 |
| Min length | 7 |
default
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| no | |
|---|---|
| unknown | |
| yes | 3 |
| Value | Count | Frequency (%) | |
| no | 32588 | 79.1% | |
| unknown | 8597 | 20.9% | |
| yes | 3 | < 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 7 |
|---|---|
| Median length | 2 |
| Mean length | 3.043702049 |
| Min length | 2 |
housing
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| yes | |
|---|---|
| no | |
| unknown | 990 |
| Value | Count | Frequency (%) | |
| yes | 21576 | 52.4% | |
| no | 18622 | 45.2% | |
| unknown | 990 | 2.4% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 7 |
|---|---|
| Median length | 3 |
| Mean length | 2.644022531 |
| Min length | 2 |
loan
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| no | |
|---|---|
| yes | |
| unknown | 990 |
| Value | Count | Frequency (%) | |
| no | 33950 | 82.4% | |
| yes | 6248 | 15.2% | |
| unknown | 990 | 2.4% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 7 |
|---|---|
| Median length | 2 |
| Mean length | 2.271875303 |
| Min length | 2 |
contact
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| cellular | |
|---|---|
| telephone |
| Value | Count | Frequency (%) | |
| cellular | 26144 | 63.5% | |
| telephone | 15044 | 36.5% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 8.365252015 |
| Min length | 8 |
month
Categorical
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| may | |
|---|---|
| jul | |
| aug | |
| jun | |
| nov | |
| Other values (5) |
| Value | Count | Frequency (%) | |
| may | 13769 | 33.4% | |
| jul | 7174 | 17.4% | |
| aug | 6178 | 15.0% | |
| jun | 5318 | 12.9% | |
| nov | 4101 | 10.0% | |
| apr | 2632 | 6.4% | |
| oct | 718 | 1.7% | |
| sep | 570 | 1.4% | |
| mar | 546 | 1.3% | |
| dec | 182 | 0.4% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
day_of_week
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| thu | |
|---|---|
| mon | |
| wed | |
| tue | |
| fri |
| Value | Count | Frequency (%) | |
| thu | 8623 | 20.9% | |
| mon | 8514 | 20.7% | |
| wed | 8134 | 19.7% | |
| tue | 8090 | 19.6% | |
| fri | 7827 | 19.0% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
duration
Real number (ℝ≥0)
| Distinct | 1544 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 258.2850102 |
|---|---|
| Minimum | 0 |
| Maximum | 4918 |
| Zeros | 4 |
| Zeros (%) | < 0.1% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 36 |
| Q1 | 102 |
| median | 180 |
| Q3 | 319 |
| 95-th percentile | 752.65 |
| Maximum | 4918 |
| Range | 4918 |
| Interquartile range (IQR) | 217 |
Descriptive statistics
| Standard deviation | 259.2792488 |
|---|---|
| Coefficient of variation (CV) | 1.003849386 |
| Kurtosis | 20.24793801 |
| Mean | 258.2850102 |
| Median Absolute Deviation (MAD) | 94 |
| Skewness | 3.263141255 |
| Sum | 10638243 |
| Variance | 67225.72888 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 85 | 170 | 0.4% | |
| 90 | 170 | 0.4% | |
| 136 | 168 | 0.4% | |
| 73 | 167 | 0.4% | |
| 124 | 164 | 0.4% | |
| 87 | 162 | 0.4% | |
| 104 | 161 | 0.4% | |
| 72 | 161 | 0.4% | |
| 111 | 160 | 0.4% | |
| 106 | 159 | 0.4% | |
| Other values (1534) | 39546 | 96.0% |
| Value | Count | Frequency (%) | |
| 0 | 4 | < 0.1% | |
| 1 | 3 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 3 | < 0.1% | |
| 4 | 12 | < 0.1% |
| Value | Count | Frequency (%) | |
| 4918 | 1 | < 0.1% | |
| 4199 | 1 | < 0.1% | |
| 3785 | 1 | < 0.1% | |
| 3643 | 1 | < 0.1% | |
| 3631 | 1 | < 0.1% |
campaign
Real number (ℝ≥0)
| Distinct | 42 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.567592503 |
|---|---|
| Minimum | 1 |
| Maximum | 56 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 7 |
| Maximum | 56 |
| Range | 55 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.770013543 |
|---|---|
| Coefficient of variation (CV) | 1.078836903 |
| Kurtosis | 36.97979514 |
| Mean | 2.567592503 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 4.762506697 |
| Sum | 105754 |
| Variance | 7.672975028 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 17642 | 42.8% | |
| 2 | 10570 | 25.7% | |
| 3 | 5341 | 13.0% | |
| 4 | 2651 | 6.4% | |
| 5 | 1599 | 3.9% | |
| 6 | 979 | 2.4% | |
| 7 | 629 | 1.5% | |
| 8 | 400 | 1.0% | |
| 9 | 283 | 0.7% | |
| 10 | 225 | 0.5% | |
| Other values (32) | 869 | 2.1% |
| Value | Count | Frequency (%) | |
| 1 | 17642 | 42.8% | |
| 2 | 10570 | 25.7% | |
| 3 | 5341 | 13.0% | |
| 4 | 2651 | 6.4% | |
| 5 | 1599 | 3.9% |
| Value | Count | Frequency (%) | |
| 56 | 1 | < 0.1% | |
| 43 | 2 | < 0.1% | |
| 42 | 2 | < 0.1% | |
| 41 | 1 | < 0.1% | |
| 40 | 2 | < 0.1% |
pdays
Real number (ℝ≥0)
| Distinct | 27 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 962.475454 |
|---|---|
| Minimum | 0 |
| Maximum | 999 |
| Zeros | 15 |
| Zeros (%) | < 0.1% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 999 |
| Q1 | 999 |
| median | 999 |
| Q3 | 999 |
| 95-th percentile | 999 |
| Maximum | 999 |
| Range | 999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 186.9109073 |
|---|---|
| Coefficient of variation (CV) | 0.194198103 |
| Kurtosis | 22.22946263 |
| Mean | 962.475454 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -4.922189916 |
| Sum | 39642439 |
| Variance | 34935.68728 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 999 | 39673 | 96.3% | |
| 3 | 439 | 1.1% | |
| 6 | 412 | 1.0% | |
| 4 | 118 | 0.3% | |
| 9 | 64 | 0.2% | |
| 2 | 61 | 0.1% | |
| 7 | 60 | 0.1% | |
| 12 | 58 | 0.1% | |
| 10 | 52 | 0.1% | |
| 5 | 46 | 0.1% | |
| Other values (17) | 205 | 0.5% |
| Value | Count | Frequency (%) | |
| 0 | 15 | < 0.1% | |
| 1 | 26 | 0.1% | |
| 2 | 61 | 0.1% | |
| 3 | 439 | 1.1% | |
| 4 | 118 | 0.3% |
| Value | Count | Frequency (%) | |
| 999 | 39673 | 96.3% | |
| 27 | 1 | < 0.1% | |
| 26 | 1 | < 0.1% | |
| 25 | 1 | < 0.1% | |
| 22 | 3 | < 0.1% |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1729629989 |
|---|---|
| Minimum | 0 |
| Maximum | 7 |
| Zeros | 35563 |
| Zeros (%) | 86.3% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.4949010798 |
|---|---|
| Coefficient of variation (CV) | 2.861311858 |
| Kurtosis | 20.10881622 |
| Mean | 0.1729629989 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.832042243 |
| Sum | 7124 |
| Variance | 0.2449270788 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 35563 | 86.3% | |
| 1 | 4561 | 11.1% | |
| 2 | 754 | 1.8% | |
| 3 | 216 | 0.5% | |
| 4 | 70 | 0.2% | |
| 5 | 18 | < 0.1% | |
| 6 | 5 | < 0.1% | |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 35563 | 86.3% | |
| 1 | 4561 | 11.1% | |
| 2 | 754 | 1.8% | |
| 3 | 216 | 0.5% | |
| 4 | 70 | 0.2% |
| Value | Count | Frequency (%) | |
| 7 | 1 | < 0.1% | |
| 6 | 5 | < 0.1% | |
| 5 | 18 | < 0.1% | |
| 4 | 70 | 0.2% | |
| 3 | 216 | 0.5% |
poutcome
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| nonexistent | |
|---|---|
| failure | |
| success | 1373 |
| Value | Count | Frequency (%) | |
| nonexistent | 35563 | 86.3% | |
| failure | 4252 | 10.3% | |
| success | 1373 | 3.3% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 10.45372439 |
| Min length | 7 |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.08188550063 |
|---|---|
| Minimum | -3.4 |
| Maximum | 1.4 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | -3.4 |
|---|---|
| 5-th percentile | -2.9 |
| Q1 | -1.8 |
| median | 1.1 |
| Q3 | 1.4 |
| 95-th percentile | 1.4 |
| Maximum | 1.4 |
| Range | 4.8 |
| Interquartile range (IQR) | 3.2 |
Descriptive statistics
| Standard deviation | 1.570959741 |
|---|---|
| Coefficient of variation (CV) | 19.18483405 |
| Kurtosis | -1.062631525 |
| Mean | 0.08188550063 |
| Median Absolute Deviation (MAD) | 0.3 |
| Skewness | -0.7240955492 |
| Sum | 3372.7 |
| Variance | 2.467914506 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1.4 | 16234 | 39.4% | |
| -1.8 | 9184 | 22.3% | |
| 1.1 | 7763 | 18.8% | |
| -0.1 | 3683 | 8.9% | |
| -2.9 | 1663 | 4.0% | |
| -3.4 | 1071 | 2.6% | |
| -1.7 | 773 | 1.9% | |
| -1.1 | 635 | 1.5% | |
| -3 | 172 | 0.4% | |
| -0.2 | 10 | < 0.1% |
| Value | Count | Frequency (%) | |
| -3.4 | 1071 | 2.6% | |
| -3 | 172 | 0.4% | |
| -2.9 | 1663 | 4.0% | |
| -1.8 | 9184 | 22.3% | |
| -1.7 | 773 | 1.9% |
| Value | Count | Frequency (%) | |
| 1.4 | 16234 | 39.4% | |
| 1.1 | 7763 | 18.8% | |
| -0.1 | 3683 | 8.9% | |
| -0.2 | 10 | < 0.1% | |
| -1.1 | 635 | 1.5% |
cons.price.idx
Real number (ℝ≥0)
| Distinct | 26 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 93.57566437 |
|---|---|
| Minimum | 92.201 |
| Maximum | 94.767 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 92.201 |
|---|---|
| 5-th percentile | 92.713 |
| Q1 | 93.075 |
| median | 93.749 |
| Q3 | 93.994 |
| 95-th percentile | 94.465 |
| Maximum | 94.767 |
| Range | 2.566 |
| Interquartile range (IQR) | 0.919 |
Descriptive statistics
| Standard deviation | 0.578840049 |
|---|---|
| Coefficient of variation (CV) | 0.00618579684 |
| Kurtosis | -0.8298085772 |
| Mean | 93.57566437 |
| Median Absolute Deviation (MAD) | 0.38 |
| Skewness | -0.2308876514 |
| Sum | 3854194.464 |
| Variance | 0.3350558023 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 93.994 | 7763 | 18.8% | |
| 93.918 | 6685 | 16.2% | |
| 92.893 | 5794 | 14.1% | |
| 93.444 | 5175 | 12.6% | |
| 94.465 | 4374 | 10.6% | |
| 93.2 | 3616 | 8.8% | |
| 93.075 | 2458 | 6.0% | |
| 92.201 | 770 | 1.9% | |
| 92.963 | 715 | 1.7% | |
| 92.431 | 447 | 1.1% | |
| Other values (16) | 3391 | 8.2% |
| Value | Count | Frequency (%) | |
| 92.201 | 770 | 1.9% | |
| 92.379 | 267 | 0.6% | |
| 92.431 | 447 | 1.1% | |
| 92.469 | 178 | 0.4% | |
| 92.649 | 357 | 0.9% |
| Value | Count | Frequency (%) | |
| 94.767 | 128 | 0.3% | |
| 94.601 | 204 | 0.5% | |
| 94.465 | 4374 | 10.6% | |
| 94.215 | 311 | 0.8% | |
| 94.199 | 303 | 0.7% |
cons.conf.idx
Real number (ℝ)
| Distinct | 26 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -40.50260027 |
|---|---|
| Minimum | -50.8 |
| Maximum | -26.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | -50.8 |
|---|---|
| 5-th percentile | -47.1 |
| Q1 | -42.7 |
| median | -41.8 |
| Q3 | -36.4 |
| 95-th percentile | -33.6 |
| Maximum | -26.9 |
| Range | 23.9 |
| Interquartile range (IQR) | 6.3 |
Descriptive statistics
| Standard deviation | 4.628197856 |
|---|---|
| Coefficient of variation (CV) | -0.1142691537 |
| Kurtosis | -0.3585583105 |
| Mean | -40.50260027 |
| Median Absolute Deviation (MAD) | 4.4 |
| Skewness | 0.3031798587 |
| Sum | -1668221.1 |
| Variance | 21.4202154 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| -36.4 | 7763 | 18.8% | |
| -42.7 | 6685 | 16.2% | |
| -46.2 | 5794 | 14.1% | |
| -36.1 | 5175 | 12.6% | |
| -41.8 | 4374 | 10.6% | |
| -42 | 3616 | 8.8% | |
| -47.1 | 2458 | 6.0% | |
| -31.4 | 770 | 1.9% | |
| -40.8 | 715 | 1.7% | |
| -26.9 | 447 | 1.1% | |
| Other values (16) | 3391 | 8.2% |
| Value | Count | Frequency (%) | |
| -50.8 | 128 | 0.3% | |
| -50 | 282 | 0.7% | |
| -49.5 | 204 | 0.5% | |
| -47.1 | 2458 | 6.0% | |
| -46.2 | 5794 | 14.1% |
| Value | Count | Frequency (%) | |
| -26.9 | 447 | 1.1% | |
| -29.8 | 267 | 0.6% | |
| -30.1 | 357 | 0.9% | |
| -31.4 | 770 | 1.9% | |
| -33 | 172 | 0.4% |
| Distinct | 316 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.621290813 |
|---|---|
| Minimum | 0.634 |
| Maximum | 5.045 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 0.634 |
|---|---|
| 5-th percentile | 0.797 |
| Q1 | 1.344 |
| median | 4.857 |
| Q3 | 4.961 |
| 95-th percentile | 4.966 |
| Maximum | 5.045 |
| Range | 4.411 |
| Interquartile range (IQR) | 3.617 |
Descriptive statistics
| Standard deviation | 1.734447405 |
|---|---|
| Coefficient of variation (CV) | 0.4789583313 |
| Kurtosis | -1.406802622 |
| Mean | 3.621290813 |
| Median Absolute Deviation (MAD) | 0.108 |
| Skewness | -0.7091879564 |
| Sum | 149153.726 |
| Variance | 3.0083078 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 4.857 | 2868 | 7.0% | |
| 4.962 | 2613 | 6.3% | |
| 4.963 | 2487 | 6.0% | |
| 4.961 | 1902 | 4.6% | |
| 4.856 | 1210 | 2.9% | |
| 4.964 | 1175 | 2.9% | |
| 1.405 | 1169 | 2.8% | |
| 4.965 | 1071 | 2.6% | |
| 4.864 | 1044 | 2.5% | |
| 4.96 | 1013 | 2.5% | |
| Other values (306) | 24636 | 59.8% |
| Value | Count | Frequency (%) | |
| 0.634 | 8 | < 0.1% | |
| 0.635 | 43 | 0.1% | |
| 0.636 | 14 | < 0.1% | |
| 0.637 | 6 | < 0.1% | |
| 0.638 | 7 | < 0.1% |
| Value | Count | Frequency (%) | |
| 5.045 | 9 | < 0.1% | |
| 5 | 7 | < 0.1% | |
| 4.97 | 172 | 0.4% | |
| 4.968 | 992 | 2.4% | |
| 4.967 | 643 | 1.6% |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5167.035911 |
|---|---|
| Minimum | 4963.6 |
| Maximum | 5228.1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 4963.6 |
|---|---|
| 5-th percentile | 5017.5 |
| Q1 | 5099.1 |
| median | 5191 |
| Q3 | 5228.1 |
| 95-th percentile | 5228.1 |
| Maximum | 5228.1 |
| Range | 264.5 |
| Interquartile range (IQR) | 129 |
Descriptive statistics
| Standard deviation | 72.25152767 |
|---|---|
| Coefficient of variation (CV) | 0.01398316732 |
| Kurtosis | -0.003760375696 |
| Mean | 5167.035911 |
| Median Absolute Deviation (MAD) | 37.1 |
| Skewness | -1.044262407 |
| Sum | 212819875.1 |
| Variance | 5220.28325 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 5228.1 | 16234 | 39.4% | |
| 5099.1 | 8534 | 20.7% | |
| 5191 | 7763 | 18.8% | |
| 5195.8 | 3683 | 8.9% | |
| 5076.2 | 1663 | 4.0% | |
| 5017.5 | 1071 | 2.6% | |
| 4991.6 | 773 | 1.9% | |
| 5008.7 | 650 | 1.6% | |
| 4963.6 | 635 | 1.5% | |
| 5023.5 | 172 | 0.4% |
| Value | Count | Frequency (%) | |
| 4963.6 | 635 | 1.5% | |
| 4991.6 | 773 | 1.9% | |
| 5008.7 | 650 | 1.6% | |
| 5017.5 | 1071 | 2.6% | |
| 5023.5 | 172 | 0.4% |
| Value | Count | Frequency (%) | |
| 5228.1 | 16234 | 39.4% | |
| 5195.8 | 3683 | 8.9% | |
| 5191 | 7763 | 18.8% | |
| 5176.3 | 10 | < 0.1% | |
| 5099.1 | 8534 | 20.7% |
y
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| no | |
|---|---|
| yes |
| Value | Count | Frequency (%) | |
| no | 36548 | 88.7% | |
| yes | 4640 | 11.3% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| age | job | marital | education | default | housing | loan | contact | month | day_of_week | duration | campaign | pdays | previous | poutcome | emp.var.rate | cons.price.idx | cons.conf.idx | euribor3m | nr.employed | y | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 56 | housemaid | married | basic.4y | no | no | no | telephone | may | mon | 261 | 1 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | no |
| 1 | 57 | services | married | high.school | unknown | no | no | telephone | may | mon | 149 | 1 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | no |
| 2 | 37 | services | married | high.school | no | yes | no | telephone | may | mon | 226 | 1 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | no |
| 3 | 40 | admin. | married | basic.6y | no | no | no | telephone | may | mon | 151 | 1 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | no |
| 4 | 56 | services | married | high.school | no | no | yes | telephone | may | mon | 307 | 1 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | no |
| 5 | 45 | services | married | basic.9y | unknown | no | no | telephone | may | mon | 198 | 1 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | no |
| 6 | 59 | admin. | married | professional.course | no | no | no | telephone | may | mon | 139 | 1 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | no |
| 7 | 41 | blue-collar | married | unknown | unknown | no | no | telephone | may | mon | 217 | 1 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | no |
| 8 | 24 | technician | single | professional.course | no | yes | no | telephone | may | mon | 380 | 1 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | no |
| 9 | 25 | services | single | high.school | no | yes | no | telephone | may | mon | 50 | 1 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | no |
Last rows
| age | job | marital | education | default | housing | loan | contact | month | day_of_week | duration | campaign | pdays | previous | poutcome | emp.var.rate | cons.price.idx | cons.conf.idx | euribor3m | nr.employed | y | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 41178 | 62 | retired | married | university.degree | no | no | no | cellular | nov | thu | 483 | 2 | 6 | 3 | success | -1.1 | 94.767 | -50.8 | 1.031 | 4963.6 | yes |
| 41179 | 64 | retired | divorced | professional.course | no | yes | no | cellular | nov | fri | 151 | 3 | 999 | 0 | nonexistent | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | no |
| 41180 | 36 | admin. | married | university.degree | no | no | no | cellular | nov | fri | 254 | 2 | 999 | 0 | nonexistent | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | no |
| 41181 | 37 | admin. | married | university.degree | no | yes | no | cellular | nov | fri | 281 | 1 | 999 | 0 | nonexistent | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | yes |
| 41182 | 29 | unemployed | single | basic.4y | no | yes | no | cellular | nov | fri | 112 | 1 | 9 | 1 | success | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | no |
| 41183 | 73 | retired | married | professional.course | no | yes | no | cellular | nov | fri | 334 | 1 | 999 | 0 | nonexistent | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | yes |
| 41184 | 46 | blue-collar | married | professional.course | no | no | no | cellular | nov | fri | 383 | 1 | 999 | 0 | nonexistent | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | no |
| 41185 | 56 | retired | married | university.degree | no | yes | no | cellular | nov | fri | 189 | 2 | 999 | 0 | nonexistent | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | no |
| 41186 | 44 | technician | married | professional.course | no | no | no | cellular | nov | fri | 442 | 1 | 999 | 0 | nonexistent | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | yes |
| 41187 | 74 | retired | married | professional.course | no | yes | no | cellular | nov | fri | 239 | 3 | 999 | 1 | failure | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | no |
Most frequent
| age | job | marital | education | default | housing | loan | contact | month | day_of_week | duration | campaign | pdays | previous | poutcome | emp.var.rate | cons.price.idx | cons.conf.idx | euribor3m | nr.employed | y | count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 24 | services | single | high.school | no | yes | no | cellular | apr | tue | 114 | 1 | 999 | 0 | nonexistent | -1.8 | 93.075 | -47.1 | 1.423 | 5099.1 | no | 2 |
| 1 | 27 | technician | single | professional.course | no | no | no | cellular | jul | mon | 331 | 2 | 999 | 0 | nonexistent | 1.4 | 93.918 | -42.7 | 4.962 | 5228.1 | no | 2 |
| 2 | 32 | technician | single | professional.course | no | yes | no | cellular | jul | thu | 128 | 1 | 999 | 0 | nonexistent | 1.4 | 93.918 | -42.7 | 4.968 | 5228.1 | no | 2 |
| 3 | 35 | admin. | married | university.degree | no | yes | no | cellular | may | fri | 348 | 4 | 999 | 0 | nonexistent | -1.8 | 92.893 | -46.2 | 1.313 | 5099.1 | no | 2 |
| 4 | 36 | retired | married | unknown | no | no | no | telephone | jul | thu | 88 | 1 | 999 | 0 | nonexistent | 1.4 | 93.918 | -42.7 | 4.966 | 5228.1 | no | 2 |
| 5 | 39 | admin. | married | university.degree | no | no | no | cellular | nov | tue | 123 | 2 | 999 | 0 | nonexistent | -0.1 | 93.200 | -42.0 | 4.153 | 5195.8 | no | 2 |
| 6 | 39 | blue-collar | married | basic.6y | no | no | no | telephone | may | thu | 124 | 1 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.855 | 5191.0 | no | 2 |
| 7 | 41 | technician | married | professional.course | no | yes | no | cellular | aug | tue | 127 | 1 | 999 | 0 | nonexistent | 1.4 | 93.444 | -36.1 | 4.966 | 5228.1 | no | 2 |
| 8 | 45 | admin. | married | university.degree | no | no | no | cellular | jul | thu | 252 | 1 | 999 | 0 | nonexistent | -2.9 | 92.469 | -33.6 | 1.072 | 5076.2 | yes | 2 |
| 9 | 47 | technician | divorced | high.school | no | yes | no | cellular | jul | thu | 43 | 3 | 999 | 0 | nonexistent | 1.4 | 93.918 | -42.7 | 4.962 | 5228.1 | no | 2 |